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The magnetic diagnostics subsystem of the LISA Technology Package (LTP) on board the LISA 
PathFinder (LPF) spacecraft includes a set of four tri-axial fiuxgate magnetometers, intended to 
measure with high precision the magnetic field at their respective positions. However, their readouts 
do not provide a direct measurement of the magnetic field at the positions of the test masses, and 
hence an interpolation method must be designed and implemented to obtain the values of the 
magnetic field at these positions. However, such interpolation process faces serious difficulties. 
Indeed, the size of the interpolation region is excessive for a linear interpolation to be reliable while, 
on the other hand, the number of magnetometer channels does not provide sufficient data to go 
beyond the linear approximation. We describe an alternative method to address this issue, by means 
of neural network algorithms. The key point in this approach is the ability of neural networks to learn 
from suitable training data representing the behavior of the magnetic field. Despite the relatively 
large distance between the test masses and the magnetometers, and the insufficient number of data 
channels, we find that our artificial neural network algorithm is able to reduce the estimation errors 
of the field and gradient down to levels below 10%, a quite satisfactory result. Learning efficiency 
can be best improved by making use of data obtained in on-ground measurements prior to mission 
launch in all relevant satellite locations and in real operation conditions. Reliable information on 
that appears to be essential for a meaningful assessment of magnetic noise in the LTP. 

PACS numbers: 02.60.Ed, 02.90.+p, 07.05.Mh, 07.05.Fb, 07.87.+V, 04.30.-w, 04.80.Nn, 06.30.Ka 



I. INTRODUCTION 

LISA Pathfinder {LPF) is a science and technol- 
ogy demonstrator programmed by the European Space 
Agency (ESA) within its LISA mission activities [1]. 
LISA (Laser Interferometer Space Antenna) is a joint 
ESA-NASA mission which will be the first low frequency 
(milli-Hz) gravitational wave detector, and also the first 
space-borne gravitational wave observatory. LPPs pay- 
load is the LISA Technology Package (LTP), and will be 
the highest sensitivity geodesic explorer flown to date. 
The LTP is designed to measure relative accelerations be- 
tween two test masses (TM) in nominal free fall (geodesic 
motion) with a noise budget of 
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in the frequency band between 1 mHz and 30 mHz [2]. 

Noise in the LTP arises as a consequence of various dis- 
turbances, mainly generated within the spacecraft itself, 
which limit the performance of the instrument. A num- 
ber of these disturbances are monitored and dealt with by 
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means of suitable devices, which form the so-called Diag- 
nostics Subsystem [3 . In LPF, this includes thermal and 
magnetic diagnostics, plus the radiation monitor, which 
provides counting and spectral information on ionizing 
particles hitting the spacecraft. The magnetic diagnos- 
tics system will be the subject of our attention here. 

One of the most important functions of the LTP mag- 
netic diagnostics is the determination of the magnetic 
field and its gradient at the positions of the TMs. For 
this, it includes a set of four tri-axial fiuxgate magne- 
tometers, intended to measure with high precision the 
magnetic field at the positions they occupy in the space- 
craft — see figure [T] Their readouts do not however pro- 
vide a direct measurement of the magnetic field at the po- 
sitions where the TMs are, and an interpolation method 
must therefore be implemented to calculate it. In the 
circumstances we face, this is a difficult problem, mostly 
because the magnetometers layout is such that they are 
too distant from the locations of the TMs compared with 
the typical scales of the distribution of magnetic sources 
in the satellite. Its solution is however imperative since 
magnetic noise can be as high as 40 % of the total bud- 
get [2 given by Eq. ([T]), and hence it must be properly 
quantified. 

In order to design a suitable interpolation scheme, in- 
formation on the actual distribution of magnetic sources 
is necessary. Data from the spacecraft manufacturer 
(EADS Astrium Stevenage, UK) have kindly been 
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FIG. 1: Artist view of the LPF science-craft. The LCA is in 
the center, surrounded by a double cyUndrical shield. Outside 
it, a number of electronic boxes are represented, most of which 
are sources of magnetic field. The four magnetometers are 
the white little boxes indicated by the arrows (magnetometer 
#3 is however not visible), and are mounted on the outer 
cylindrical shell. 



handed to us [4 for this purpose. According to these 
data, magnetic sources can be characterized as magnetic 
dipoles, whose positions are known and whose magnetic 
moments are only known in modulus — not in orienta- 
tion. Most of these dipoles are associated to electronic 
boxes, with a few genuinely magnetic elements. An ex- 
ception to this rule is the solar panels, which cover the en- 
tire spacecraft and can hardly be considered as a dipole as 
seen by the magnetometers. They are however designed 
so that their cells are arranged to minimize magnetic ef- 
fects by having their rim wires wound contiguous and in 
opposite senses. 

Astrium data are based on system design, so valida- 
tion with the real spacecraft must be done by means of 
experiment, which is of course included in the planned 
activities before launch. Actually, though, the structure 
of the magnetic source distribution and their properties 
will not be directly visible either to the magnetometers or 
to the interpolation algorithms, which will just work with 
magnetic field values no matter how they are generated. 
Nevertheless, we think that the information available so 
far, though not final, qualifies very well as a guide to the 
elaboration of a magnetic model which will be needed 
to define and verify the performance of the analysis al- 
gorithms which will eventually be applied to the data 
delivered by the satellite in flight. 

In this paper we will make use of the dipole model 
of the sources to assess the performance of two different 
types of interpolation methods: multipole interpolation 
and neural network algorithms. The first is the more im- 
mediate one to try, but as we will show below it is not as 
efficient as one might expect a priori. To overcome this 



problem we propose a novel method, based on neural 
networks. Based on the results obtained with the same 
dipole source model, our solution looks promising since 
the errors of the interpolated fields and gradients are sig- 
nificantly smaller than those obtained with the multipole 
approach. The paper is structured as follows. In Sect. 
[TI| we provide a general description of the problem. It 
follows Sect. |III[ where we discuss the multipole interpo- 
lation, whereas in Sect. |IV| we explain our neural network 
approach. The results of applying this algorithm are pre- 
sented in Sect. |V| while in Sect. |VI| we summarize our 
major findings and we draw our conclusions. 



II. GENERAL DESCRIPTION OF THE 
PROBLEM 



Magnetic noise in the LTP is allowed to be a signifi- 
cant fraction of the total mission acceleration noise: 1.2 x 
10~^^ ms~^ Hz~^/^ can be apportioned to magnetism, 
i.e., 40% of the total noise, 3x10"^^ ms-^ Hz-^/^ see 
Eq. This noise occurs because the residual mag- 

netization and susceptibility of the TMs couple to the 
surrounding magnetic field, giving rise to a force 
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in each of the TMs. In this expression B is the mag- 
netic field in the TM, x ^ its magnetic sus- 
ceptibility and density of magnetic moment (magnetiza- 
tion), respectively, and V is the volume of the TM; /io is 
the vacuum magnetic constant, 47r x 10~^ mkgs~^ 
and (• • •) indicates TM volume average of the enclosed 
quantity. Moreover, the magnetic field and its gradient 
randomly fluctuate in the regions occupied by the test 
masses, thus resulting in a randomly fluctuating force: 
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where SB represents the fluctuation of the magnetic field, 
and stands for the fluctuation of the gradient [5 . 

Quantitative assessment of magnetic noise in the LTP 
clearly requires real-time monitoring of the magnetic 
field, which in LPF is done by means of a set of four 
tri-axial fluxgate magnetometers [6]. These devices have 
a high-permeability magnetic core, which drives a design 
constraint to keep them somewhat far from the TMs. The 
price to be paid for this is that the measured field is not 
directly useful (we need to know it at the positions of the 
TMs). Hence, a procedure to estimate it at these posi- 
tions, based on the data delivered by the magnetometers, 
must be set up. 

As previously mentioned, the sources of magnetic field 
are essentially electronics boxes plus a few genuinely mag- 
netic components inside the spacecraft. The interplane- 
tary magnetic field is orders of magnitude weaker, hence 
of little relevance to the effects considered here, and so- 
lar panel effects will not be considered — see section [l| 
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FIG. 2: Conceptual diagram: magnetic sources (green dots, 
size proportional to the modulus of the magnetic moment of 
the source) , Test Masses (red dots) , and the four magnetome- 
ters (black triangles). Also represented (in dark blue) is the 
wall of the LCA. 

There are no sources of magnetic field inside the LTP 
Core Assembly (LCA), all being placed outside its walls. 
The number of Astrium identified sources is around 40, 
and can be modeled as point magnetic dipoles [4 . Fig- 
ure |2] gives an overview of the geometry, see caption for 
details. 



III. MULTIPOLE INTERPOLATION THEORY 

Perhaps the most immediate (and obvious) procedure 
to interpolate the magnetic field is to resort to its mul- 
tipole structure. This is known to be the best option 
in some mathematical sense [7 . Consequently, we first 
describe the details of its implementation, and then we 
assess its practical merit. 

We will treat the LCA region as a vacuum. This is 
a reasonable hypothesis, as the materials inside it are 
essentially no n- magnetic. Accordingly, the magnetic field 
has zero divergence and rotational [TTj: 

V.B(x, t) = V X B(x, t) = (4) 

Since V x B(x, t) = 0, we thus have 

B(x,t) = Vvl/(x,t) (5) 

where ^(x, t) is a scalar function. Additionally, since 
V'B(x, t) = 0, too, it immediately follows that ^(x, t) is 
a harmonic function, or 

V^^(x,t) = (6) 

The solution to this equation can be expressed as an 
orthogonal series of the form 

CO I 

*(x,t) = ^ ^ Mi^(t)r'Fi„(n) (7) 

^=0 m^-l 



where 

r = |x| , n = x/r (8) 

are the spherical coordinates of the field point x, whose 
origin is by (arbitrary) convention assumed in the geo- 
metric center of the LCA. Equation ^ could also con- 
tain terms proportional to r~^~^, but these have been 
dropped because the field cannot diverge at the center of 
the LCA. Actually, the expansion of Eq. ^ is only valid 
in a region interior to the closest field source. Finally, the 
coefficients Mimit)^ which will be called multipole coef- 
ficients in the sequel, depend on the sources of magnetic 
field. 

To obtain the field components we take the derivative 
of Eq. following Eq. (js]): 

CO I 

B(x,t) = V«'(x,t) = ^ Mi^{t)V[r'Yi^{u)] 

l=\ rn= — l 

(9) 

According to standard mathematics, the coefficients 
Mim{t) can be fully determined if the magnetic field is 
known at the boundary of the volume where the field 
equations are considered, in this case the LCA. This data 
is of course not available to us, since we only know B in 
four points of the boundary, where the magnetometers 
are. Therefore the question we need to address is: how 
many terms of the series can we possibly determine on 
the basis of the limited information available? Or, equiv- 
alently, how many multipole coefficients can we estimate, 
given the magnetometers readout data? Then, also, to 
which accuracy can we estimate the actual magnetic field 
after the maximum number of multipole coefficients have 
been calculated? 

The answer to the first question above is actually not 
difficult: let us assume that the series in Eq. ([9| is trun- 
cated after a maximum multipole index value l = L. The 
estimated field. Be, is then given by: 

L I 

Be(x,t) = ^ ^ Mi^{t)V[r'Yi^{Y^)] (10) 

l—\ rn— — l 

The number of terms in this sum is 

L 

N{L) = ^ (2/ + 1) = L(L + 2) (11) 

which obviously equals the number of multipole coeffi- 
cients needed to evaluate the sum. For example, we have 
A/'(2) = 8 and 7V(3) = 15. On the other hand, the num- 
ber of magnetometer data channels is 12 — three chan- 
nels per magnetometer, as the devices are tri-axial. This 
means we cannot push the series beyond the quadrupole 
(1 = 2) terms. This means that since we only have 12 data 
channels we have some redundancy to determine the first 
eight Mim(t) coefficients up to / = 2, though we also lack 
information to evaluate the next seven octupole terms 
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In order to make a best estimate of the Mim{t), a least- 
square method is set up as fohows. Firstly, we define a 
quadratic error: 



(12) 



where Br is the real magnetic field and the sum extends 
over the number of magnetometers, situated at positions 
Xg (5 = 1,. . . ,4). We then find those values of which 
minimize the error: 







(13) 



Once this system of equations is solved, the estimated 
coefficients Mim{t) are replaced back into Eq. (10) and 
then the spatial arguments x substituted by the positions 
of each test mass to finally obtain the interpolated field 
values. This process needs to be repeated for each instant 
t of time at which measurements are taken, thereby gen- 
erating the magnetic field time series. The gradient is 
estimated by taking the derivatives of Eq. (10): 
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It is to be noted that Eq. (10) is a polynomial of de- 
gree L—1 in the space coordinates {x^y^z)^ hence its de- 
gree equals 1 when L = 2. Since this is the most we can 
get of the magnetometer readout channels, the multi- 
pole expansion is actually equivalent to a linear inter- 
polation of the field between its values at the boundary 
of the LCA and its interior. We may therefore not ex- 
pect this method to produce excellent results, simply be- 
cause the magnetic field inside the LCA is weaker than at 
its boundaries, the reason being that the magnetic field 
sources are outside the LCA. This valley structure of the 
magnetic field needs at least octupole (quadratic) terms 
to be approximated, but this would require at least one 
more vector magnetometer, which is not available. By 
the same argument, the field gradient can only be ap- 
proximated by a constant value throughout the LCA — 
see Eq. (III). 



A. Numerical simulations 

In order to have a quantitative idea of the actual per- 
formance of the above interpolation scheme, we make use 
of the source dipole model. It has the following ingredi- 
ents and assumptions: 

1. The sources of magnetic are point dipoles outside 
the LCA. 

2. The sources are those identified by Astrium Steve- 
nage, as already mentioned, whose positions in the 



S/C are known. The set itself, as well as the source 
magnetic parameters need to be updated, but the 
data used (which date back to November 2006) 
qualifies to verify the performance of the interpola- 
tion methods. 

3. The magnetic field created by the dipole distribu- 
tion at a generic point x and time t is therefore 
given by 

T3/^ _ ^0 V" 3[ma(t)»na] Ug - ma{t) 

where 11^= (x — Xa)/|x — x^l are unit vectors con- 
necting the the a-th dipole 111^ with the field point 
X, and n is the number of dipoles. 

4. Fluctuations of the dipoles, both in modulus and 
direction, are unknown, but this is not essential to 
assess the numerical performance of the algorithm. 

We aim to compare interpolated magnetic field results 
with exact ones within the context and scope of the above 
model. To artificially simulate several possible scenarios, 
we will take advantage of the uncertainties in the source 
dipole orientations to randomly generate different mag- 
netic field patterns, which we intend to reconstruct based 
on the multipole expansion. More specifically, the proce- 
dure is the following one: 

1. Each dipole has a known fixed position in the space- 
craft, and a fixed modulus, also known. The num- 
ber of magnetic dipoles is also fixed to 37, which is 
the number in Astrium's list. 

2. The orientations of the dipoles are instead un- 
known. An example scenario is characterized by 
a specific selection of the 37 dipole orientations. 

3. In order to explore the behavior of the algorithm, a 
batch of examples are examined, each correspond- 
ing to a randomly generated set of dipole orienta- 
tions. 



4. In each case, Eq. (13) is solved for M^^, and the 



field estimate at each TM is then calculated with 
Eq. (10). In the last step, the result is compared 



with the theoretical one given in Eq. (15), and the 
differences annotated. 

5. Finally, a statistical analysis of the differences (er- 
rors) is done. 

The random character of the procedure may seem un- 
realistic, since the actual satellite configuration is not 
random. In this context, however, randomness is an ef- 
ficient way of mimicking lack of knowledge. As we will 
see in the next section, numerical analysis based on this 
methodology sheds much light on the merits of the inter- 
polation procedure — as it will also be the case when we 
come to neural networks performance in section [V| 
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1. Simulation results 

In this section we summarise the most relevant results 
of the analysis of the multipole interpolation method. We 
use a batch of 1 000 example scenarios such as described 
above. Magnetic moment orientations were chosen by 
randomly picking values of the two defining spherical an- 
gles if) from two independent uniform distributions. 

Figure [3] graphically represents a magnetic field map 
in the LCA region corresponding to an arbitrarily chosen 
example out of the 1 000 considered. The valley struc- 
ture is very clear in the |B| plot, while the compo- 
nent shows a saddle shape — see figure caption. By and 
Bz show qualitatively similar forms, and thus we do not 
show them. The elliptical forms in the estimate of |B| 
are due to the quadratic combination of the field compo- 
nents. The estimate of Bx shows instead a linear struc- 
ture, with constant gradient in all directions. Naked eye 
inspection immediately reveals a poor resemblance be- 
tween estimated and exact quantities, but let us elabo- 
rate some numerical data. 

Figure [4] displays the binned distribution of estimation 
errors, defined by 



|Bh = 



|Br| 



B:r 



Br I 



(16) 



where we have used a denominator |Br| in e{Bx) to avoid 
meaningless infinities when Bx is close to zero. By and 
Bz show similar trends and are not displayed. As can 





FIG. 3: Magnetic field contour plots in the LCA region for a 
given source dipole configuration, x is the direction between 
the two TMs, and y is in the "horizontal" plane, which in 
the plot is at the TMs centres of mass altitude. Left panels: 
moduli of the magenetic field. The top panel displays the 
exact one, and the bottom one shows its multipole estimate. 
Right panels: same as in the left panels, but for one of the 
field components (Bx). The modulus of the magnetic field 
shows a complex structure in the central area, while Bx has a 
saddle structure there — see along the diagonals of the frame 
graph. The red dots mark the centers of the TMs, and the 
black triangles the positions of the magnetometers. 
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FIG. 4: Binned errors of the estimated modulus of the mag- 
netic field and of its x component. They are reported in 
percentage. Colors correspond to each of the LTP TMs, re- 
spectively. Inset values of a indicate the rms half- width of the 
distributions. Solid lines are Gaussian fits to the histograms. 



TABLE I: Averaged absolute value of the estimation errors 
in the components of the magnetic field and of its modulus. 
They are reported in relative percent. 
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be seen, errors average to zero, but have rms deviations 
well above 100 %. Even worse, outliers are significant, as 
can be seen in Table ^ where averaged absolute values 
over the 1000 simulated cases are displayed. Except, 
obviously, for the modulus error, we are around 500%, 
but detailed examination of individual data further shows 
that errors as high as 2 000 % eventually happen. 

The most salient features of the numerical analysis can 
be briefiy summarized. Firstly we find that magnetic field 
estimation errors are very variable, ranging from very few 
percent to over 1 000 % and, secondly, these huge uncer- 
tainties happen in an utterly random and fully unpre- 
dictable way. The a posteriori conclusion is quite simple: 
the intrinsic linear character of the interpolation scheme 
is not capable of reproducing the field structure inside 
the LCA — hence at the positions of the TMs — and, 
therefore, can produce very good or very bad results just 
by accident. In addition to not being predictable, the 
average error is any case too large. The ultimate rea- 
son for such poor performance is the the small number 
of magnetometers as well as their positioning: four mag- 
netometers only allow for a field multipole expansion up 
to quadrupole terms, which means that the field values 
at the TMs are just linearly interpolated between mag- 
netometer readouts at the boundary of the LCA. On the 
other hand, the magnetometers are closer to the magnetic 
field sources than they are to the TMs, which prevents 
resolution of the spatial field structure details inside the 
LCA with only linear terms in the space coordinates. 
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FIG. 5: Feed-forward neural network architecture. Magnetometers readings are the system inputs, and estimates of the field 
and gradient at the positions of the test masses are the outputs of the system. In this architecture, only one intermediate, or 
hidden layer is assumed. Each of the circles represents one neuron and corresponds to the model of Eqs. (17) and (18). 



IV. A NOVEL APPROACH: NEURAL 
NETWORKS 

Search for an alternative approach to the above in- 
terpolation schemes is imperative, otherwise the infor- 
mation provided by the magnetometers will hardly be 
useful for the main goal of the LTP magnetic diagnostics 
system, i.e., to quantify the contribution of the magnetic 
noise to the total system noise. Here some promising re- 
sults are presented on the implementation of a completely 
different methodology: neural networks [8 . 

Artificial neural networks are made up of intercon- 
necting artificial neurons (programming constructs that 
mimic the properties of biological neurons) that have the 
capacity to learn from processing data. Neural networks 
are often used in solving nonlinear classification and re- 
gression tasks by learning from data, hence are worth 
trying with the present problem [9]. 

There are four sets of tasks which need to be imple- 
mented when solving a problem with artificial neural net- 
works: 

1. Neuron model selection 

2. Model and architecture selection 

3. Learning paradigm and learning algorithm selec- 
tion 

4. Performance assessment 

We next go through them, one by one. 



A. Neuron model 

The neuron is the basic unit of any neural network. It 
performs the following two operations: 

• It collects the inputs from all other neurons con- 
nected to it and computes a weighted sum of the 
signals the latter inject into it, generally adding 
a bias as well. If we represent the inputs by 
a vector x = (xi, . . . , Xn), and the weights by a 
w = (xi, . . . , Wn) then this operation consists in cal- 
culating the sum 



Wo ■ 



n 

E 



(17) 



where the superindex T stands for transpose ma- 
trix; in this case, w-^ is a row vector while x is a 
column vector, so that w^x is the scalar product 
of w and x. A term wq is added to form the most 
general linear function of the vector argument x; it 
is called the bias. 

• The above sum is used as the argument to the so- 
called activation function, (p{Ti). The neuron's out- 
put, also known as its activation, is thus 



(18) 



In general, (p{Ti) can be selected in many differ- 
ent ways. Here, different iable activation functions 
will be used, which suit well the gradient descent 
back-propagation learning algorithm — see sections 
below. 
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B. Neural network architecture 

Artificial neural networks are software or hardware 
models inspired by the structure and behavior of bio- 
logical systems, and they are created by a set of neurons 
distributed in layers. There are many different types of 
neural networks in use today, but the architecture of a so- 
called feed-forward network, where each layer of neurons 
is linked with the next by means of a set of weights, is 
the most commonly used, and will also be used here. The 
specific architecture adopted in this study is shown in 
figure |5j The data streams coming from the magnetome- 
ters will be considered the system inputs, while magnetic 
field results and their gradients at the positions of the 
test masses will be the system outputs. 



C. Learning paradigms and learning/training 
algorithms 

The investigation of learning algorithms is currently an 
active field of research. The design and implementation 
of an adequate training scheme is the essential ingredi- 
ent for obtaining a good-quality estimate of the magnetic 
field and its gradient at the LTP TMs. 



1. Learning paradigms 

There are two major learning paradigms, each corre- 
sponding to a particular abstract learning task. These 
are supervised learning and unsupervised learning. 

1. Supervised learning. The idea of this paradigm is 
quite clearly suggested by its very name. A set of 
examples is filed, each set consisting in a number 
of vector of inputs (the magnetometers' readouts 
in this case) and the corresponding values of the 
magnetic field and its gradient at the TMs for a 
given distribution of dipoles in the spacecraft. Let 
X represent a generic input vector, and y the asso- 
ciated vector output. These two vectors constitute 
an example. The set of filed examples for super- 
vised learning is thus a set of pairs (x,y), where 
xgX and yGF, X and Y being some suitable sam- 
ple spaces. The network is then fed the inputs x of 
one example and let it work out an output, o, say. 
This output is then compared with the correct one, 
y, and an error is calculated if o^y. Iterations 
are then triggered to adjust the weighting factors 
such that the error is minimized. These will how- 
ever vary as different examples are run, so a cost 
function is defined which enables the network to op- 
timise the set of weights which works best for the 
set of examples analyzed, based on some suitable 
criterion. 

2. Unsupervised learning. In unsupervised learning a 
cost function is to be minimized as well, but this 



function can be any relationship between x and 
the network output, o, but never taking into ac- 
count the real expected target. The cost function 
is determined by the task formulation. Unsuper- 
vised learning is thus a form of self-adaptive sys- 
tem, whose guide is not an a priori knowledge of 
the final result but knowledge gained from experi- 
ence. 

In either case, the learning process is based on the ar- 
chitecture of the network, i.e., number of neurons and 
layers and their interconnections, as well as on the acti- 
vation functions. These are parameters which, at least in 
the simplest cases, are tuned ab initio by the user based 
on observed performance of the network. In this study, 
supervised learning has been the implemented learning 
paradigm. 



2. Learning algorithms 

There are many algorithms for training neural net- 
works. When training feed- forward neural networks with 
supervised learning, a back-propagation algorithm is usu- 
ally implemented. The error of the mapping at the out- 
put is propagated backwards in order to readjust the 
weights and improve the output error for the next iter- 
ation. The propagation can be implemented with differ- 
ent methods, the Ideal Gradient Descent being a classic 
which will also be used here, with slight modifications 
that make the algorithm convergence faster. 

Iterations on the weights of the different neurons at 
the different layers proceed according to the following 
algorithm: 



V 



dw 



(19) 



where n labels the current iteration step, and rj is the 
learning rate, adjustable by the user. E is the sum over 
the set of training examples of the square errors of the 
outputs: 



E = ^(o-yr(o-y) 



(20) 



where s stands for the number of examples, o is the 
(vector) output from the network, while y is the target, 
or correct output in the corresponding example. The 
quantity E can only be defined in supervised learning, of 
course, and the idea of the above procedure is to find that 
point in weight space where E is an absolute minimum. 
E can therefore be considered the cost function to be 
minimized in this particular supervised training scheme, 
also known as batch mode as the analysis is done across 
the entire set of training patterns in a single block. 

There are a number of technical issues in pursuing the 
iterations in Eq. (19), such as the choice of the initial 



set of weights, the identification of local minima of E^ 
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FIG. 6: Error distributions for each field component at tfie 
position of test mass 1 (black line) and test mass 2 (red line) . 
The top left panel displays the results for Bx, the top right 
panel shows the error distribution for By, while the bottom 



left panel depicts the distribution obtained for Bz 
bottom right panel that for |B|. 



and the 



the boundary effects,. .. which need to be addressed in 
each specific case. We skip a detailed discussion of these 
matters here and we focus on the results obtained using 
our method. For further details, the reader is referred to 
Refs. iSj and 



D. Performance assessment 

In this last step, the trained network must be tested 
with examples which differ from those used in the learn- 
ing process. This is needed to assess whether or not the 
trained neural network is able to generate the expected 
results when fed with previously unseen inputs, hence 
determine its usability for the specific purpose it is in- 
tended. 



V. RESULTS 

Training and testing have been done based on differ- 
ent field realizations, using the same model of sources 
and magnetic field described in section III A[ i.e., each 
example will consist in the magnetic field at the magne- 
tometers' positions, plus the magnetic field and gradient 
at the TM positions, all of them corresponding to a given 
configuration of the 37 Astrium dipoles. 

Two different batches of examples, each including 1 000 
realizations of a possible magnetic environment, have 
been generated following the directives explained in sec- 
tion [TTrAj The first batch has been used as the training 
set for a neural network with 12 inputs (3 inputs for each 
of the 4 vector magnetometers) and 16 outputs represent- 
ing the field information at the position of the two test 
masses (3 field plus 5 gradient components per test mass 
[13^). The second batch has been used for validation to 
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FIG. 7: Correlation coefficient between the information enter- 
ing the network (each magnetometer input) and the outputs 
provided by the trained network for the field estimates. 



assess the performance of the network in front of unseen 
magnetometers readings. 



A. Field estimation 

Figure [6] shows the distribution of relative errors (in 
percentage) of the estimated components of the magnetic 
field at the positions of each TM. The plot is based on the 
results of the 1 000 validation runs described in the previ- 
ous section. As can be observed, the order of magnitude 
of the errors of the estimated fields are now within much 
more acceptable margins (below ~ 15%). This represents 
a reduction of estimation errors of more than one order of 
magnitude in comparison with the multipole expansion 
method. 

During the training process, the neural network even- 
tually learns that the magnetic field at the TMs is gener- 
ally smaller than the magnetometers read — with occa- 
sional exceptions due to the rich and complex structure 
of the field inside the LCA, see e.g. figure [3j The neural 
network is able to derive an inference procedure which is 
actually quite efficient, and it does so by proper adjust- 
ment of its weight matrix coefficients w as explained in 
section [TVC II In order to better understand the reac- 
tion of the trained neural network to the magnetometers' 
data, we found instructive and expedient to look into re- 
lationships between the data read by the magnetometers 
and the magnetic field estimates generated at the output 
of the neural network. We chose to calculate correla- 
tion coefficients between input and output data, and the 
results are displayed in Figure [T] The following major 
features are identified: 

• Each component of the field is basically estimated 
from the magnetometers reading of the same com- 
ponent. For example, the interpolation of the Bx 
component in test mass 1 is mostly dependent on 
the Bx readings of the magnetometers. 
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FIG. 8: Probability density function of the errors distribu- 
tion for the three components of Vi^x- From top to bottom: 
dBx/dx^ dBx/dy and dBx/dz at the positions of the test 
masses. Errors are given in percentages, the black lines cor- 
responding to TMl, and the red ones to TM2. 



• The measurements of the magnetometers closer to 
the interpolation points have larger weights. For 
instance, when the field is estimated at the position 
of TMl, to which M4 is the closest magnetometer, 
the value it measures is the largest contributor to 
the interpolated field in TMl. At the same time. 
Ml and M3 being nearly equidistant from both test 
masses, their weights are almost identical. 



B. Gradient interpolation 

The magnetic field gradient can also be estimated. The 
9 components dBi/dBj of the gradient are not indepen- 
dent, since they must verify Eqs. Q, which reduce their 
number to 5. The remaining 4 components can be eas- 
ily calculated thereafter. Another option is to estimate 
the 9 gradient components regardless of the previously 
mentioned constraint, in which case they are actually 
found not to satisfy them. Discrepancies are however 
within the estimation error range, so we do not adopt 
this option here as it is slightly more cumbersome due to 
the correspondingly increased complexity of the network. 

Results on gradient estimation are shown in Figure [S] 
for ^Bx at the positions of both TMs. As can be ob- 
served, they are also within much more acceptable mar- 
gins than the earlier interpolation approach could possi- 
bly produce. It is to be noted that no apparent or easily 
deductible physical relationship is found between the es- 
timated gradient at the test mass positions and the mag- 
netometer inputs, in contrast with what we have found 
for the field estimation. 



C. Statistical analysis 

In Table [ll] we present a statistical comparison of the 
properties of the distribution of interpolated magnetic 



TABLE II: Statistical properties of the distribution of errors 
of the interpolated magnetic field. 
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1.5204 


-0.0028 
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-0.1014 
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fields. For the sake of conciseness we only list the sta- 
tistical properties of the interpolated modulus and x- 
component of the magnetic field. In particular, we show 
the standard deviation (a) of the interpolating errors for 
both the multipole interpolation and the neural network 
estimate, the skewness of the distribution (71) and the 
corresponding kurtosis (72). Clearly, and as already men- 
tioned, the interpolating errors are very large for the case 
in which a multipole interpolating scheme is used, as 
clearly shown by the very large standard deviation ob- 
tained when using this method. Also interesting to note 
is that for the case of the x-component of the magnetic 
field both methods yield distributions which are almost 
symmetrical. However this is not the case for the modu- 
lus of the magnetic field when the multipole interpolat- 
ing method is used. Finally, the kurtosis of the multipole 
interpolation is very large, revealing a large number of 
outliers. All in all, a look at Table [ll] reveals that the 
neural network method presents much better statistical 
properties than the multipole interpolation. 



VI. CONCLUSIONS 

The magnetic diagnostics sensor set in the LTP is such 
that to infer the magnetic field and gradient at the po- 
sitions of the TMs based on the readouts of the mag- 
netometers is far from simple. The more standard in- 
terpolation scheme, based on a multipole expansion of 
the magnetic field inside the LCA volume, cannot go be- 
yond quadrupole order which, in practice, means that 
just a linear approximation can be done, due to the re- 
duced number of magnetometers available. This grossly 
fails to produce reliable results, with errors exceedingly 
large. This has motivated our search for better alterna- 
tives. Artificial neural networks have been presented as a 
more elaborate, non-linear procedure to estimate the re- 
quired field values at the TM positions. In this paper we 
have presented results which very significantly improve 
the performance of the multipole expansion technique by 
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almost two orders of magnitude. This a very encouraging 
outcome which points to the use of the neural networks 
as the baseline tool to analyse LTP magnetic data. 

One of the main problems of using the neural network 
to assess the magnetic field at the positions of the test 
masses is to find a training process adequate to the set 
of data that the magnetometers will deliver in fiight. 
This underlines the need to characterize on ground to 
our best ability the magnetic field distribution across the 
LCA for as many as possible foreseeable working con- 
ditions, both regarding DC and fiuctuating values. Re- 
liable information on this is essential for a meaningful 
assessment of magnetic noise in the LTP. However, the 
neural network analyses presented in this paper only ap- 
ply to static fields. What they actually show is that 
neural networks work very well (~ 10 % accuracies) no 
matter which the source dipole configuration is. A dif- 
ferent issue, which is beyond the scope of this paper and 
it is currently under investigation, is how to deal with 



time series of magnetometer readouts, which is of course 
the kind of data the satellite will transmit to ground. 
Features such as trends, field fiuct uat ions,. .. will likely 
happen during mission operations, and the neural net- 
work algorithm must be trained to properly deal with 
them. Preliminary results indicate that the network is 
able to deal with moderate trends and levels of fiuctua- 
tions, but further effort is needed to explore alternatives, 
e.g. self- adaptability, which will make more robust the 
performance of the system. 
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